CLAIMS 

Please amend the following claims. 




1. (Currently Amended) A method for performing a gather operation on a 




computer processor comprising: 

computing addresses for a plurality of data elements of a matrix stored in memory 
utilizin g a plurality onindices and a base address ; 

retrieving each o*(said data elements from memory based on the computed 
addresses; and 

executing a plurality \rf instructions, each instruction depositing one or more of 
said data elements contiguously with other data elements in a storage location. 

2. (Original) The method\is in claim 1 wherein said storage locations are 
registers. 



and 



3. (Original) The method as in>claim 1 wherein computing addresses comprises: 
extracting indices for each of said Hata elements into separate storage locations; 

adding each of said indices to a base address. 



4. (Previously Amended) The method afc in claim 1 further comprising: 
loading each of said data elements from memory into separate storage locations 
prior to executing said plurality of instructions. 



5. (Previously Amended) The method as in claim 1 wherein said computer 
processor executes two or more of said instructions in a siiigle clock cycle. 
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(Original) The method as in claim 1 further comprising: 
ng each of said data elements on a mass storage device. 

7. (Original) The method as in claim 2 wherein said registers are 64-bits wide 
and said data elements are 16-bits in length. 

8. (CurrentR Amended) A method for performing a scatter operation on a 
computer processor comprising: 

calculating addresses in memory to which a plurality of data elements are to be 
scattered to form a matrix inVnemory utilizing a plurality of indices and a base address : 

executing a plurality obstructions, each of said instructions extracting one or 
more of said data elements fromV storage location in which said data elements are stored 
contiguously; and \ 

storing said data elements to\aid addresses in memory. 

9. (Original) The method as inVlaim 8 wherein said storage location is a 
register. \ 

10. (Previously Amended) The methoM as in claim 8 wherein calculating 
addresses comprises: \ 

extracting indices for each of said data elements into separate storage locations; 
and \ 
adding each of said indices to a base address. \ 
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I 1 K (Previously Amended) The method as in claim 8 wherein storing each of said 
data elements is accomplished via a plurality of STORE instructions executed by said 
computer processor. 

12. (Previously Amended) The method as in claim 8 wherein said computer 
processor executes two or more of said instructions in a single clock cycle. 

13. (Original) TheViethod as in claim 9 wherein said register is 64-bits wide and 
said data elements are 16-bitsvin length. 



14. (Currently Amended)\\ computer system comprising: 
a memory; 

a processor communicatively doupled to the memory; and 

a storage device communicativelV coupled to the processor and having stored 
therein a sequence of instructions which, w^ien executed by the processor, causes the 
processor to at least, 

compute addresses for a plurality of dala elements of a matrix stored in memory 
utilizing a plurality of indices and a base address\ 

retrieve each of said data elements from memory based on the computed 
addresses and 

execute a plurality of instructions, each instruction to deposit one or more of said 
data elements contiguously with other data elements in a storage location. 

\ 

15. (Original) The computer system as in claim 14 wherein said storage locations 
are registers. 
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^ / 16. ((Original) The computer system as in claim 14 wherein, responsive to one or 
more instructions in said sequence, said processor computes addresses by: 

extracting indices for each of said data elements into separate storage locations; 

and \ 

adding each of said indices to a base address. 

17. (Previously Amended) The computer system as in claim 14 wherein said 
processor loads each of saioydata elements from memory into separate storage locations 
prior to executing said plurality of DEPOSIT instructions. 

18. (Previously Amended The computer system as in claim 17 wherein said 
processor executes two or more of said instructions in a single clock cycle. 

19. (Original) The computer system as in claim 14 wherein, responsive to one or 
more instructions in said sequence, said processor further: 

stores each of said data elements on said mass storage device. 

20. (Original) The computer system ak in claim 15 wherein said registers are 64-bits 
wide and said data elements are 16-bits in lengtrk 

21. (Previously Added) A method as in claim 1 wherein computing addresses 
comprises: \ 

executing a series of instructions, each instructioia to extract an address index for one 
of said plurality of data elements. \ 
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CyJ^ 22. (Original) The method as in claim 21 wherein said address indices are extracted 

from a series of Contiguous memory locations 

23. (OriginalVrhe method as in claim 21 wherein computing addresses comprises: 
adding each of said address indices to a base address. 
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